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The estimation of the total of an attribute defined over a contin- 
uous planar domain is required in many applied settings, such as the 
estimation of canopy coverage in the Monterano Nature Reserve in 
Italy. If the design-based approach is considered, the scheme for the 
placement of the sample sites over the domain is fundamental in or- 
CLj ' der to implement the survey. In real situations, a commonly adopted 

.^^ ^ scheme is based on partitioning the domain into suitable strata, in 

• . such a way that a single sample site is uniformly placed (i.e., selected 

jrt ' with uniform probability density) in each stratum and sample sites 

are independently located. Under mild conditions on the function 

representing the target attribute, it is shown that this scheme gives 

rise to an unbiased spatial total estimator which is "superefficient" 

with respect to the estimator based on the uniform placement of inde- 

^ ' pendent sample sites over the domain. In addition, the large-sample 

r*-^ I normality of the estimator is proven and variance estimation issues 

^5^ ■ are discussed. 

^. 

^r^ I 1. Introduction. Applied scientists frequently deal with attributes de- 

^D ' fined on continuous spatial domains. In this framework, if the design-based 

^^ [ approach is assumed, the target attribute may be expressed as a fixed 

bounded function y taking values on the study region A (a suitable sub- 
set of the plane). In the simplest case, y{u) may represent the value of the 
attribute at u £ A. As an example, in an environmental survey, y{u) could 
be the air-borne pollutant level at the sample site u on a landscape. In 
a more structured setting, y{u) may also describe the "attribute density" 
arising from the selected spatial sampling design [this topic is extensively 
considered and explained in Chapter 10 of Gregoire and Valentine (2008)]. 
As an example, by supposing a fixed-radius circular plot sampling in a forest 
survey, y{u) could represent the number of trees lying in the plot centered 
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at the sample site u [up to a known proportionality constant; see Gregoire 
and Valentine (2008), pages 328-332]. In this case, under the design-based 
approach, the population universe is constituted by a continuum (ideally by 
the noncountable set of sample sites on ^4) and the inference is actually car- 
ried out by assuming the so-called "continuous-population" paradigm. This 
approach has been extensively considered in recent years on the basis of the 
seminal papers by de Gruijter and ter Braak (1990), Cordy (1993) and Brus 
and de Gruijter (1997). 

In the described framework, the estimation goal is usually focused on the 
spatial total, that is. 



(1.1) T= / y{u)du 

J A 

[see, e.g., Stevens (1997) and Chapter 10 of Gregoire and Valentine (2008)]. 
Indeed, as emphasized by Stevens (1997), the integral representation in (1.1) 
embraces a general family of population parameters, such as means, propor- 
tions or distribution functions. In order to estimate T, the key problem of the 
design-based approach is the selection of an appropriate sampling strategy. 
As usual, it is assumed that the sampling strategy includes the joint selec- 
tion of a suitable estimator and the corresponding scheme for the placement 
of the n sample sites on the study region A. Since an integral representation 
for T holds, it is quite evident that the estimation problem may be rephrased 
in terms of the Monte Carlo integration theory. Interestingly, known Monte 
Carlo integration strategies are equivalent to the sampling strategies which 
are commonly adopted in environmental and ecological studies [Barabesi 
(2003, 2007), Gregoire and Valentine (2008), page 327]. Similar Monte Carlo 
integration approaches to parameter estimation occur in very different re- 
search areas, such as in stereology [see, e.g., the monograph by Baddeley 
and Jensen (2005)] or in computer graphics [see, e.g., Agarwal et al. (2003)]. 

The basic reference sampling scheme for selecting the sample sites is 
the Uniform Random Sampling (URS), which constitutes the continuous- 
population analog to simple random sampling from a finite population [Cordy 
(1993)]. Under URS, the n sample sites are independently and uniformly se- 
lected on A [Figure 1(a)]. Despite its simplicity, URS may be not suitable 
in practice since it may produce an uneven coverage of the study region and 
the corresponding unbiased estimator of T displays a variance of order n"^, 
that is, the variance decreases to at the rate n~^ as n — > oo. In any case, 
URS is often considered a helpful benchmark to compare the performance 
of more refined schemes. 

In order to overcome the drawbacks involved with URS, a sampling scheme 
frequently adopted in environmental studies is the so-called Tessellation 
Stratified Sampling (TSS) [see, e.g., Stevens (1997) and the U.S. Environ- 
mental Protection Agency (2002), page 63]. The TSS scheme is initially 
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(b) 





(c) 



(d) 



Fig. 1. Placement of n = 25 sample sites over a study region A according to the URS 
scheme (a), the TSS scheme (b), the SGS scheme (c) and the SS scheme based on equal- 
size strata obtained by means of the Brus, Spatjens and de Gruijter (1999) algorithm (d). 
For the TSS and SGS schemes, the set R is chosen as a rectangle and the tessellation is 
based on squares, while n represents the mean number of sample sites in A. 



implemented by superimposing a suitable set R onto the study region in 
such a way that AC R and by introducing the analytical extension ye of y 
on R. Formally, the analytical extension is defined as ye{u) = y{u) if u S A 
and ydu) = ii u ^ R\A. Obviously, in this setting (1.1) may be conve- 
niently rewritten asT = Jj, ye{u) du. Subsequently, a regular tessellation of R 
is carried out and one sample site is independently and uniformly selected 
in each tessellation element [Figure 1(b)]. The theoretical properties of the 
TSS scheme have been explained in detail [Barabesi and Marcheselli (2003, 
2005a, 2005b, 2008)]. The scheme gives rise to an unbiased estimator for T 
with variance of order n~"', where 7 € (1,2]. Hence, the estimator under 
TSS is "superefficient" since its variance decreases to faster than the vari- 
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ance of the estimator under URS as n — > oo. The parameter 7 depends on 
the degree of regularity of the analytical extension yg: for instance, it turns 
out that 7 = 2 for smooth functions, but it can occur that 7 = 3/2 even 
for noncontinuous and rather irregular functions [Barabesi and Marcheselli 
(2003, 2005a, 2008)]. Hence, the variance of the estimator for T decreases 
to faster as ye becomes more regular. In addition, consistent variance es- 
timation is available if ye is a differentiable function on R [Barabesi and 
Marcheselli (2003, 2008)]. In order to avoid the difficulties involved in the 
variance estimation, Cordy and Thompson (1995) and Stevens (1997) sug- 
gest modifying the TSS scheme by randomly shifting the tessellation. The 
randomized TSS scheme does not involve extra-sampling effort and allows 
for unbiased variance estimation without any restriction on the function ye 
[Stevens (1997)], even if the consistency of the variance estimator has not 
been proved. Barabesi and Franceschi (2011) show that the TSS scheme and 
its randomized modification produce estimators for T with identical variance 
convergence rates. 

A further frequently-considered scheme is Systematic Grid Sampling 
(SGS), which constitutes a systematic version of the TSS scheme [see, e.g., 
Valentine, Affleck and Gregoire (2009) and the U.S. Environmental Protec- 
tion Agency (2002), page 70]. The SGS scheme provides the continuous- 
population analog to systematic sampling from a finite population as con- 
sidered by Madow and Madow (1944) [see also D'Orazio (2003), as to sys- 
tematic sampling of a spatial finite population]. Similarly to TSS, the SGS 
scheme requires a regular tessellation of R and the extension of the func- 
tion y on R. However, under SGS, a sample site is uniformly generated in 
the reference tessellation element and it is systematically repeated in the 
other tessellation elements [see Figure 1(c)]. The SGS scheme is commonly 
adopted in stereology and gives rise to an unbiased "superefficient" estima- 
tor for T, under certain smoothness conditions on ye [Cruz-Orive (1993), 
Baddeley and Jensen (2005), page 159]. However, the variance of the esti- 
mator tends to be extremely elevated when the periodicity in the function ye 
is "in phase" with the tessellation elements [Baddeley and Jensen (2005), 
Chapter 13]. 

Even if the TSS and SGS schemes allow for an even coverage of the study 
region and give rise to unbiased "superefficient" estimation for T, these 
schemes suffer due to two main technical drawbacks which are related to 
the analytical extension of y on R. Indeed, the sample sites are actually 
placed on R (not on A) and, hence, the number of sample sites on A is 
a random variable, unless A is exactly tessellated. Obviously, the tessella- 
tion may be selected in such a way that the mean number of sample sites 
on A equals the prefixed sample size n [as an example, this procedure is 
adopted for Figure 1(b) and (c)]. However, the task is generally undesir- 
able for field scientists, who usually require reproducible designs. Moreover, 
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even if the function y is regular on A, the function y^, is hkely to be not 
continuous on the boundary of A (and hence on R), unless y is null on 
this boundary. As previously explained, the lack of regularity considerably 
reduces the efficiency of the estimators for T. 

An alternative way to face the whole setting may be based on stratifi- 
cation methods involving the "one-per-stratum" placement of the sample 
sites. More precisely, under the "one-per-stratum" Stratified Sampling (SS), 
the study region A is partitioned into n suitable strata and one sample 
site is independently and uniformly selected in each stratum [Figure 1(d)]. 
The scheme constitutes the continuous-population counterpart to the classic 
"one-per-stratum" stratified sampling in the finite-population setting [see, 
e.g., Cochran (1946)]. Moreover, the SS scheme generalizes the TSS scheme 
when A coincides with R and the tessellation is noncongruent. The scheme 
is commonly adopted for environmental and agricultural surveys [see, e.g., 
Walvoort, Brus and de Gruiter (2010) and the references therein]. 

In this paper, it is proven that the SS scheme produces an unbiased "su- 
perefficient" estimator for T, which shares the variance properties of the 
estimator under the TSS scheme. However, in contrast with the TSS and 
SGS schemes, when the SS scheme is adopted the n sample sites are ex- 
actly placed on A and no analytical extension of y is introduced. Moreover, 
in real surveys spatial stratification is often demanded in practice, owing 
to geographical or administrative convenience and tessellation-based meth- 
ods would not be applicable. In addition, there exist ad hoc algorithms for 
partitioning the study region into strata (eventually of the same size) with 
suitable geometrical and statistical properties [Brus, Spatjens and de Grui- 
jter (1999), Walvoort, Brus and de Gruiter (2010)]. Finally, even if schemes 
with more than a single sample site per stratum may be considered, it is 
apparent that the benefits arising from the full force of the stratification 
are achieved by adopting the "one-per-stratum" allocation. In any case, the 
"two-per-stratum" scheme will be briefiy considered since it produces unbi- 
ased and consistent variance estimation. 

Even if the achieved results may be applied to a broad range of different 
data sets collected on a continuous spatial domain, the motivating practical 
setting of the paper originates from an experiment dealing with canopy cov- 
erage estimation in the Monterano Nature Reserve. Owing to the complex 
boundary mosaic of this forest, the estimation approach based on forest poly- 
gon delineation and area mensuration in the GIS environment may produce 
omission and commission errors (which tend to be systematic) in the image 
interpretation. In order to overcome these shortcomings, a survey procedure 
based on line-intercept sampling which just involves the measurement of 
the intersections of linear transects with forest patches is considered. As to 
the Monterano Nature Reserve, forest researchers collected data by placing 
transect midpoints according to the SS scheme with equal-size strata by 
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means of the Brus, Spiitjens and de Gruijter (1999). Hence, the results of 
this paper may be suitably applied in order to provide point and interval 
estimation of canopy coverage. 

2. Spatial total estimation. As pointed out in the Introduction, the bench- 
mark for comparing different schemes is the URS and, hence, spatial total 
estimation under this scheme is briefly described. liUi,U2, ■ ■ ■ ,Un are n i.i.d. 
random variables representing the sample-site locations, in such a way that 
each Ui is uniformly distributed on A, the usual unbiased estimator for T 
under URS [see, e.g., Cordy (1993)] is 

(2.1) f„=^x;^(f^^)' 

where a(-) denotes the area of a set in M^ (technically, a represents the 
Lebesgue measure in M^). The variance of the estimator in (2.1) is given by 

Var[f;] = i(a(yl)5-r2), 
n 

where S = f^y{u)'^du, and, hence, it turns out that Var[T„] = 0{n~^). As 
usual, if {an} and {6„} represent two positive sequences, a„ = 0{bn) means 
that the ratio an/bn is bounded for all n. 

Under the SS scheme, the study region A is partitioned in n strata Ai, 
A2, ■ ■ ■ ,An, in such a way that each stratum is connected and compact (in 
a topological sense), and one sample-site location is independently and uni- 
formly selected in each stratum. Therefore, let us suppose that the sample- 
site locations are represented by the random variables Vi,V2,. ■ ■ ,Vn- Ac- 
cording to the continuous Horvitz-Thompson Theorem [Cordy (1993)], an 
unbiased estimator for (1.1) under SS is 



(2.2) fn = Y,<A^)yiV^) 

i=\ 

with variance 

n n n 

al = Var[f„] = J^ a(A,)2 Var[2/(yi)] = Y. <^^^^- " E ^'' 

where Tj = J^ yiu) dn and Si = J^, y{uY dti. 

In order to assess the variance properties of the estimator in (2.2), let us 
assume that y is a Holder function on A, that is, a function satisfying the 
condition 

\y{u) — y{v)\ < H\\u — t;||", 

where H < 00, a € (0,1] and u,v £ A, while || ■ || denotes the usual norm 
in R^, that is, ||n — ujj denotes the distance between the points u and v. 
Obviously, y reduces to a Lipschitz function for the special case a = l. The 
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family of Holder functions is very large [for more details, see, e.g., Evans 
(2010), page 254]. Indeed, it is at once apparent that Holder functions are 
continuous. In addition, from the above definition, it also follows that the 
family of Holder functions contains the family of Lipschitz functions, which 
in turn contains the family of continuously differentiable functions. Infor- 
mally speaking, the Holder condition quantifies the local variation of the 
function y, in such a way that the index a may be interpreted as the corre- 
sponding "degree of local continuity." Hence, the family of Holder functions 
encompasses "smooth" functions, as well as functions displaying a very ir- 
regular behavior. As a matter of fact, there exist Holder functions which are 
continuous, but nowhere differentiable. 
By assuming that 

diam(i?) = sup ||n — f|| 

represents the diameter of a given set B, that is, the largest distance between 
two points in B, let 

dn = max diam(^j) 

j=l,2,...,n 

be the maximum diameter of the Aj's. Hence, let us consider the condition 

(2.3) dl<bn-\ 
where 6 > is a bounded constant. Since 

aiAi) <dmm{Aif <d\, i = l,2,...,n, 
condition (2.3) implies that 

a{Ai)<bn~ , i = 1,2, . . . ,n. 
In addition, let us also consider the condition 

(2.4) a{Ai)>cn~^, z = l,2,...,n, 

where c > is a bounded constant. It should be remarked that condi- 
tion (2.3) simply requires that the stratification be performed by assum- 
ing quite "homogeneous" strata, that is, avoiding strata having "stretched" 
shapes and in such a way that no "large" strata are admitted as re — > oo. 
In addition, condition (2.4) actually ensures that too "small" strata are in 
turn avoided as re — )> oo. These requirements are likely to hold for practical 
choices of ^i,^2) • • • ,An. Obviously, condition (2.4) is always satisfied with 
equal-size strata, that is, when a{Ai) = a{A)/n for i = 1, 2, . . . , re. 

On the basis of Result 1 in the Appendix, by assuming that y is a Holder 
function and that condition (2.3) holds, it turns out that 

a2=0(re-i-"). 

Hence, the SS scheme may lead to a noticeable estimation improvement with 
respect to the basic URS scheme. The best variance order re~^ is achieved 
when y is a Lipschitz function. In any case, the SS scheme produces more 
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efficient estimation with respect to URS for each a value as n — )■ oo. The gain 
may be remarkable since in many real surveys a is likely to be about one — 
for example, as to the canopy coverage estimation considered in Section 4; 
see the discussion after the formula in (4.1). 

The achieved variance properties may be extended to a larger class of 
functions. More precisely, let y be a piecewise Holder function on A, that 
is, there exists a finite partition of A in such a way that y is a Holder 
function on each partition element and the partition boundary is rectifiable, 
that is, in practical terms the boundary is "smooth." This setting is of real 
interest, since y often belongs to this function family when y represents the 
"attribute density" as defined in the Introduction. Thus, by assuming that y 
is a piecewise Holder function and that condition (2.3) holds, on the basis 
of Result 2 in the Appendix, it turns out that 

a2=0(n--i'^(i+"'3/2)). 

Hence, even if the gain is lessened owing to the discontinuity of the func- 
tion y, the performance of the SS scheme is in turn considerable. In this 
case, the best variance order n"^' ^ is achieved when y is a piecewise Holder 
function with a > 1/2. In turn, the SS scheme is preferable with respect to 
the URS scheme for each a. 

As to the large-sample normality of the estimator in (2.2), on the basis 
of Result 3 in the Appendix, by assuming that y is a Holder function and 
that conditions in (2.3) and in (2.4) hold, it follows that 

IlLZl A^ N{0,1) 

as n — 7- OO. This convergence result holds even if y is a piecewise Holder func- 
tion (see Remark 2 in the Appendix) . These findings on the variance proper- 
ties and the large-sample normality of the estimator in (2.2) are in complete 
agreement with the results obtained by Barabesi and Marcheselli (2003, 
2005a, 2008) and Barabesi and Franceschi (2011) under TSS. Indeed, the 
TSS scheme may be considered a special case of the SS scheme when A co- 
incides with R and the strata correspond to the elements of the tessellation. 
It should be finally emphasized that for each n the variance of the esti- 
mator under URS is greater than or equal to the variance of the estimator 
under SS when the strata are of the same size, that is, it holds that 

Var[7;]>Var[f;]. 

Indeed, in this case the previous inequality is verified since T = Y17=i ^* ^^^"^ 
S = Y17=i ^i^ while the inequality 

fi / n \ 



n , 



obviously holds. 
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3. Variance estimation. The estimation of a^ is not a trivial task, since 
a single observation per stratum is available and the strata generally do not 
constitute a regular tessellation, that is, the strata display different sizes and 
shapes. In such a setting, estimators relying on contrast-based techniques — 
such as the proposals by Barabesi and Marcheselli (2003, 2008) under TSS 
or the proposal by Stevens and Olsen (2003) under randomized TSS — seem 
quite difficult to implement. However, a simple estimator may be obtained by 
treating the sample as if it were obtained under the URS scheme. A similar 
procedure is suggested by Stevens and Olsen (2003) under the randomized 
TSS scheme. Hence, a naive estimator for a'^^ is given by 



2 '^ 



(3.1) at, 

n 



Since 



n—\ ^-^ \ n I 



E[a2] = a2 + B[a2], 



where 



BK1^;^E(--|)^ 

(Result 4 in the Appendix), the estimator in (3.1) is positively biased. More- 
over, if y is a Holder function and condition (2.3) holds, it follows that 

B[a2]=0(n-i) 

(Result 4 in the Appendix). Hence, even if B[a^] vanishes for large n, it 
might be of a larger order than that of a^. Moreover, if condition (2.4) 
holds, on the basis of Remark 1 in the Appendix, it promptly turns out that 

In any case, the behavior of this type of estimator seems quite stable in 
practice as emphasized by Stevens and Olsen (2003), even if its use may 
lead to a marked overestimation of a^ . 

For equal-size strata, an alternative estimator displaying more appealing 
features may be proposed. The suggested estimator is given by 

(3.2) oi = ^ Unf + J2iy(^) - y(^^+i))' + yiVnA . 



Since 



E[al] = al + B[al], 
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where 

(Result 5 in the Appendix), the estimator in (3.2) is positively biased. By 



assuming that 














Dn = max 

i=l,2,...,n- 


sup 


+1 


let us 


consider the condition 








(3.3) 






Dl 


<kn~^ 





u ■ 



with A; > a suitable bounded constant. Condition (3.3) actually requires 
that the stratification be performed by indexing the strata in such a way 
that Ai and Ai^i not be "too far" with respect to each other. In practical 
situations, Ai and Aj+i may be generally chosen as "neighbors," that is, in 
such a way that they share part of their boundary. For example, in the case 
study contained in Section 4, the partition elements are equally-sized strata 
which are indexed in such a way that the ith and the {i + l)th strata have 
a side in common (see Figure 2). If ?/ is a Holder function and conditions (2.3) 
and (3.3) are satisfied, it follows that 

B[al]=0{n~'--) 

and 



limsup 



'^'n 







(Result 5 in the Appendix). Hence, the bias order of the estimator in (3.2) 
is reduced with respect to the estimator in (3.1) and B[fT^] is of the same 
order as that of o"^. Moreover, if condition (2.4) holds, and on the basis of 
Remark 1 in the Appendix, it follows that 



t2 



0{n^~''). 



Hence, when y is a Lipschitz function, it turns out that 

Bpn] 

0<-^7^<m, 
where m > is a suitable bounded constant, while 

~2 ~2 

1 < lim inf —r: < lim sup ^ < 1 + m 

(Result 5 in the Appendix), that is, the estimator in (3.2) is large-sample 
conservative. 
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Fig. 2. The Monterano Nature Reserve and the corresponding partition m n — 50 strata 
according to the SS scheme based on equal-size strata obtained by means of the Brus, 
Spdtjens and de Gruijter (1999) algorithm. 

Finally, unbiased and consistent variance estimation is achieved if two 
sample sites are placed in each stratum, that is, if the "two-per-stratum" SS 
scheme is actually adopted. In this case, let us assume that n is even and 
that a partition of the study region into n/2 strata is carried out. Obviously, 
n is now required to be even for comparison purposes with respect to the 
"one-per-stratum" SS scheme. Moreover, let Vij and V2J represent the two 
sample sites uniformly and independently selected onto the jth stratum 
(j = 1, 2, . . . , n/2). An unbiased estimator for T is given by 

n/2 

(3.4) %,n = -Y,a{A,){y{Vi,,) + 2/(^2,,)), 



while its variance is 

al^ = Var[f2,„] = ^ g a{Ajf Var [^(^1,,)] 



n/2 2 

%/2 



2 
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Hence, an unbiased and consistent estimator for o"| „ is 

n/2 

(3.5) dl^ = - J]a(A,)2(y(li,,.) - y{V2,)f . 

i=i 

Moreover, by considering Result 1 in the Appendix, when condition (2.3) 
holds it turns out that 

4„ = 0(n-i"") 

if y is a Holder function on A, while 

4n = 0(n— -(i+"'3/2)), 

if y is a piecewise Holder function on A. However, even if the "two-per- 
stratum" SS scheme provides in turn an unbiased "superefficient" estimator 
for r, it is at once apparent that an efficiency loss occurs in using a strat- 
ification based on n/2 strata rather than n strata. In addition, if the n/2 
strata are split in such a way that each stratum is partitioned into two sub- 
strata of equal sizes and r„ is computed on the basis of this stratification, 
it is promptly shown that Var[T2^„] > Var[T„] on the basis of the discussion 
at the end of Section 2. 

4. An application to canopy coverage estimation. In order to illustrate 
an application of the SS scheme in an environmental survey, the estimation of 
the canopy coverage in the Monterano Nature Reserve has been considered. 
The Monterano Nature Reserve (which constitutes the study region A in this 
case) is located in the central part of Italy (Lazio region) and its geographical 
boundary is depicted in Figure 2. The area of the Monterano Nature Reserve 
is equal to a{A) = 1,045 ha. 

If C C ^ represents the region inside A covered by vegetation, canopy 
coverage is simply defined as the area of C, that is, in this case T = a{C). 
Canopy coverage constitutes a central indicator in forestry, as emphasized by 
Bonham (1989). In order to estimate this quantity, replicated line-intercept 
sampling is commonly adopted [Barabesi (2007), Barabesi and Marcheselli 
(2008)]. More precisely, the replicated line-intercept sampling protocol is 
carried out by selecting n sample sites on A and by considering n linear 
transects of fixed length L with the same orientation, in such a way that the 
transect midpoints are centered on each sample site. According to Barabesi 
and Marcheselli (2008), the canopy coverage T may be expressed as the 
integral of the "attribute density," that is. 



T = a{C) = \ [ l{Cnt{u))du, 
L J A 



where t{u) represents the set of points in a transect with midpoint centered 
at the sample site u, while l{-) denotes the length of a set in M (technically, / 
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represents the Lebesgue measure in M). In this case, it follows that 

y{u) = ^l{Cm{u)) 

is the length of the intersection of t{u) with C, up to a known constant. 
Hence, if the SS scheme with equal-size strata is adopted, the canopy cov- 
erage estimator reduces to 

(4.1) f„ = ^^/(Cnt(FO), 

that is, the estimator in (4.1) actually represents the total sum of the in- 
tersection lengths between C and the n transects, up to a known constant. 
Barabesi and Marcheselli (2008) remark that if the boundary of C is rectifi- 
able, y is a Holder function. Hence, if condition (2.3) holds, it turns out that 
Var[r„] = 0{n~^~"^). In particular, if C is given by the union of circles or el- 
lipses, it may be proven that Var[T'„] = 0(?i~^+'^) where e > 0. Moreover, if C 
is given by the union of polygons, it may be proven that Var[r„] = Oln'"^). 
Hence, in real settings, the estimator in (4.1) may be very efficient. 

Even if the canopy coverage could be estimated by means of polygon de- 
lineation on the basis of visual interpretation of remotely sensed imagery, 
the procedure may typically produce errors and omissions [see, e.g.. Corona, 
Chirici and Travaglini (2004)]. So, in order to avoid the interpretation draw- 
backs in the estimation of forest features such as forest ecotone or canopy 
coverage. Corona, Chirici and Travaglini (2004) suggest adopting replicated 
line-intercept sampling. Hence, for estimating canopy coverage in the Mon- 
terano Nature Reserve, the replicated line-intercept sampling protocol has 
been implemented by assuming the described procedure with n = 50 tran- 
sects with fixed direction and length L = 200 m [these choices are consistent 
with the study by Corona, Chirici and Travaglini (2004)]. The transect mid- 
points (displayed in Figure 2) have been placed by adopting the SS scheme 
with equal-size strata obtained by using the Brus, Spatjens and de Gruijter 
(1999) algorithm. In this case, the estimator in (4.1) has given rise to the 
estimate 660 ha for the canopy coverage. Hence, 63.16% of the Monterano 
Nature Reserve is covered by vegetation. In addition, variance estimation 
has been performed on the basis of (3.2) by adopting a sequential indexing 
of strata with a common side (see Figure 2). Accordingly, the standard de- 
viation estimate is 58 ha and a conservative confidence interval for canopy 
coverage at the approximate 95% confidence level is (647 ha, 674 ha). Thus, 
a conservative confidence interval at the same level for the percent coverage 
is given by (61.87%, 64.46%). 

5. Concluding remarks. Under the design-based approach, the target at- 
tribute of many surveys can be conceptualized as a suitable fixed function y 
defined on a given planar domain. This approach is usually described as the 
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continuous-population paradigm and it is especially suitable in environmen- 
tal and ecological frameworks [see, e.g., Williams and Eriksson (2002) and 
Gregoire and Valentine (2008), page 2]. Indeed, in such spatial contexts, it is 
not often possible to achieve an area frame in order to apply the usual finite- 
population sampling theory. Regrettably, practitioners frequently force the 
continuous-population setting into the finite-population setting, owing to 
the lack of results or to the misunderstanding of the continuous-population 
paradigm. 

The continuous-population paradigm requires implementation of an effec- 
tive probability sampling design to estimate the target parameter, usually 
the total of the study attribute given by the integral of the function y. 
Hence, a key decision is the choice of the sampling scheme for the placement 
of sample sites. Schemes based on tessellation and stratification are widely 
used in natural resource assessment and for environmental monitoring, since 
evenly-spread sample sites over the study region often simplify collection of 
field data and estimation efficiency is usually increased. However, as empha- 
sized by Walvoort, Brus and de Gruiter (2010), schemes based on tessellation 
methods may involve several drawbacks and, hence, stratification schemes 
may often be preferable. 

In the present paper it is shown that the "one-per-stratum" placement of 
the sample sites produces an unbiased "superefficient" spatial total estima- 
tor with respect to the uniform placement of independent sample sites. Vari- 
ance properties and convergence results for the suggested estimator are given 
in a purely design-based approach without assuming any super-population 
model on the spatial correlation structure of the target attribute, as usu- 
ally considered for systematic and stratified sampling of a two-dimensional 
population [see, e.g., Bellhouse (1977) and Breidt (1995)]. In contrast, the 
present findings are achieved by assuming very mild conditions on the func- 
tion y (which are likely to be met in any real survey) and by requiring simple 
conditions which avoid strata of too small or too large sizes, as well as strata 
with stretched shapes. 

APPENDIX 

Result 1. Let y be a Holder function. Hence, since each Ai is assumed 
to be connected and y is a continuous function, there exists Ui € Ai for each 
i = l,2, . . . ,n such that 

y{ui) = E[y{Vi)]. 

Accordingly, since the Holder condition holds for y, we obtain 

Var[y(y,)] = E[(y(y.) - E[y(y,)])'] = E[(2/(V-) - y(n,))'] 

<H^d[am{Aif''<H'^dl". 
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Hence, it holds that 



al = ^a(A,)2 Var[y(V-,)] < H'dl"^a{A,, 



i=l 1=1 

Smce a{Ai) < d^ and X]"=i ci{Ai) = a{A), it also turns out that 

al<H'dl-^'-aiA). 
In addition, if condition (2.3) holds, it follows that 

that is, 0-2 =0(n~i"°). 

Result 2. Let y be a piecewise Holder function on A. Moreover, by 
denoting B as the boundary of the partition, let us assume that 

I = {i:AinB^0}. 

Moreover, if 

M = sup \y{u)\, 

uGA 

then Ysir[y{Vi)] < M^ and it holds that 

^a(^,)2Var[y(yi)] < M^df^caid{I), 

where card(-) denotes cardinality of a set. Since B is rectifiable, it turns out 
that 

card(/) < Ci7i^/^, 

where Ci > is a suitable bounded constant. Hence, by assuming condi- 
tion (2.3), it follows that 



^a{A,fYar[y{Vi)]<b^M^C_ 



m 3/2. 



i£l 



In addition, since y is a Holder function on Ai for i ^ /, by assuming the 
achievements in Result 1, it holds that 

Y,a{AifVaT[yiVi)] < H^di+^^ cardiF). 

Moreover, it turns out that 

card(r) < Csn, 
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where C2 > is a suitable bounded constant. Thus, by assuming condi- 
tion (2.3), it fohows that 

Hence, it is finally seen that 

that is, cj2 =0(n^'^i'^(i+°'3/2)). 

Remark 1. If y is not a constant function on A and if condition (2.3) 
holds, it follows that 



liminf y Var[y(KO] > My, 



where My > is a bounded constant depending on y. Hence, from condi- 
tion (2.4) we have 

al > c^MyU-^. 

Result 3. Let y be a Holder function and let us assume that condi- 
tions (2.3) and (2.4) hold. In order to prove the large-sample normality of 
the estimator in (2.2), it suffices to verify the Lyapunov condition, that is, 

lim ^ = 0, 
where 

n 

i=l 

By assuming the notation and the findings of Result 1, it turns out that 

n 

Vn<dlY.a{A,fEMVi)-y{u,)\% 

i=l 

Moreover, since the Holder condition holds for y, it also follows that 



n 

< Hdl^'^ Y. «(^^)' Var [y(y.)] = Hdl+^al 



1=1 
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and, hence, 

Thus, on the basis of condition (2.3) and Remark 1, it follows that 
and, hence, the Lyapunov condition is proven. 



Remark 2. The large-sample normality of the estimator in (2.2) may be 
proven even if y is a piecewise Holder function and conditions (2.3) and (2.4) 
hold. This result may be shown in a general setting by verifying the Raikov 
condition. 

Result 4. Since the estimator in (3.1) may be rewritten as 



a. 



i=l 



it follows that 



n 

E[^n] = ^5]a(A,)2E[y(V-)2] - ^^3yE[f2] 



= +r=) = <rJ + B[s;], 



n 

On the basis of the notation and the findings of Result 1, since 

T,<Ma{Ai), 
where M is defined in Result 2, it turns out that 



n 



2^/ A\^2 I oT^2^ — 1 



< 2M^a{A)dt, + 2T'n 
Hence, if condition (2.3) holds, it follows that 



^[Ti--] <2{bM^a{A)+T^)n 

i=l 



and, hence, B[ct^] = 0{n ^). 
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Result 5. Since the estimator in (3.2) may be rewritten as 

2 / n n— 1 



al 



/' A\Z I I li -L 

it follows that 



\i=l i=l 



n n— 1 

Moreover, if y is a Holder function, we have 



(T, - r,+i)2 < ^(2/(n,) - y(n,+i))^ 



Hence, if condition (3.3) holds, it turns out that 

[Ti - Ti+if < A;"//2^(A)2?i-2-°. 

Thus, it follows that 

B[a2] < M^a{Afn~^ + k''H^a{Afn^^~'', 

where M is defined in Result 2, and, hence, B[a^] = 0{n~^~"). Moreover, if 
condition (2.4) holds, owing to Remark 1, we obtain 

n>l V "n / n>l '^ 



4 



2l\4i 



y n>l 

since 

where C3 > is a suitable bounded constant. Hence, it follows that 

lim""-f"'=0. 

n-s>oo (T„ 
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